Example-Based Grapheme-to-Phon

نویسندگان

  • Paisarn Charoenpornsawat
  • Tanja Schultz
چکیده

Several characteristics of the Thai writing system make Thai grapheme-to-phoneme (G2P) conversion very challenging. In this paper, we propose an Example-Based Grapheme-toPhoneme conversion approach. It generates the pronunciation of a word by selecting, modifying and combining pronunciations from syllables from training corpus. The best system achieves 80.99% word accuracy and 94.19% phone accuracy which significantly outperform previous approaches for Thai.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving recognition of proper nouns in ASR through generating and filtering phonetic transcriptions

Accurate phonetic transcription of proper nouns can be an important resource for commercial applications that embed speech echnologies, such as audio indexing and vocal phone directory lookup. However, an accurate phonetic transcription is more difficult o obtain for proper nouns than for regular words. Indeed, phonetic transcription of a proper noun depends on both the origin of the peaker pro...

متن کامل

Example-based grapheme-to-phoneme conversion for Thai

Several characteristics of the Thai writing system make Thai grapheme-to-phoneme (G2P) conversion very challenging. In this paper, we propose an Example-Based Grapheme-toPhoneme conversion approach. It generates the pronunciation of a word by selecting, modifying and combining pronunciations from syllables from training corpus. The best system achieves 80.99% word accuracy and 94.19% phone accu...

متن کامل

Automatic generation of phonological variations

A recognition system must include structures which accoupt f?r ~he di~feren~ aspects of. possible phonological vanab1hty m the mcommg speech s1gnal. To aid in dealing with variability at the phonological level, a grapheme-toseveral-phoneme strings module, V ARIONO, has been developed at LIMSI. Tests of this system showed the necess1ty of creating a hierarchical structure to order phon<?lo~ica~ ...

متن کامل

Integrating Thai grapheme based acoustic models into the ML-MIX framework - for language independent and cross-language ASR

Grapheme based speech recognition is a powerful tool for rapidly creating automatic speech recognition (ASR) systems in new languages. For purposes of language independent or cross language speech recognition it is necessary to identify similar models in the different languages involved. For phoneme based multilingual ASR systems this is usually achieved with the help of a language independent ...

متن کامل

Phon: Free Software for Phonological Transcription and Analysis

1. OVERVIEW. Phon is an open-source program for the transcription and analysis of phonological and phonetic data. It was designed to help systematize research in children’s phonological development, but many functions in Phon, particularly the powerful search function, can be used for a wide range of investigations in phonetics and phonology. Phon is compatible with other language processing pr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006